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Abstract. Let / be a multivariate density and /„ be a kernel estimate of / 
drawn from the n-sample Xi, • • • , A„ of i.i.d. random variables with density 
/. We compute the asymptotic rate of convergence towards of the volume 
of the symmetric difference between the t-level set {/ > t} and its plug-in 
estimator {/„ > t}. As a corollary, we obtain the exact rate of convergence 
of a plug-in type estimate of the density level set corresponding to a fixed 
probability for the law induced by /. 
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1. Introduction. Recent years have witnessed an increasing interest in esti- 
mation of density level sets and in related multivariate mappings problems. 
The main reason is the recent advent of powerfull mathematical tools and 
computational machinery that render these problems much more tractable. 
One of the most powerful application of density level sets estimation is in 
unsupervised cluster analysis (see Hartigan [1]), where one tries to break a 
complex data set into a series of piecewise similar groups or structures, each 
of which may then be regarded as a separate class of data, thus reducing 
overall data compexity. But there are many other fields where the knowl- 
edge of density level sets is of great interest. For example, Devroye and Wise 
[2], Grenander [3], Cuevas [4] and Cuevas and Fraiman [5] used density sup- 
port estimation for pattern recognition and for detection of the abnormal 
behavior of a system. 

In this paper, we consider the problem of estimating the i-level set C{t) 
of a multivariate probability density / with support in from independent 
random variables Xi, - ■ ■ , Xn with density /. Recall that for t > 0, the t-level 
set of the density / is defined as follows : 

C{t) = {xeIR^ : f{x) > t}. 
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The question now is how to define the estimates of (t) from the n-sample 
Xi, - ■ ■ ,Xn ? Even in a nonparametric framework, there are many possible 
answers to this question, depending on the restrictions one can impose on 
the level set and the density under study. Mainly, there are two families of 
such estimators : the plug-in estimators and the estimators constructed by 
an excess mass approach. Assume that an estimator /„ of the density / is 
available. Then a straightforward estimator of the level set (t) is {/„ > t}, 
the plug-in estimator. Molchanov [6, 7] and Cuevas and Fraiman [5] proved 
consistency of these estimators and obtained some rates of convergence. The 
excess mass approach suggest to first consider the empirical mapping M„ 
defined for every borel set L C M'' by 

1 " 

Tl . - 
1=1 

where A denotes the Lebesgue measure on M'^. A natural estimator of (t) is a 
maximizer of Mn{L) over a given class of borel sets L. For different classes of 
level sets (mainly star-shaped or convex level sets), estimators based on the 
excess mass approach were studied by Hartigan [8], Miiller [9], Miiller and 
Sawitzki [10], Nolan [11] and Polonik [12], who proved consistency and found 
certain rates of convergence. When the level set is star-shaped, Tsybakov [13] 
recently proved that the excess mass approach gives estimators with opti- 
mal rates of convergence in an asymptotically minimax sense, whithin the 
studied classes of densities. Though this result has a great theoretical in- 
terest, assuming the level set to be convex or star-shaped appears to be 
somewhat unsatisfactory for the statistical applications. Indeed, such an as- 
sumption does not permit to consider the important case where the density 
under study is multimodal with a finite number of modes, and hence the 
results can not be applied to cluster analysis in particular. In comparison, 
the plug-in estimators do not care about the specific shape of the level set. 
Moreover, another advantage of the plug-in approach is that it leads to eas- 
ily computable estimators. We emphasize that, if the excess mass approach 
often gives estimators with optimal rates of convergence, the complexity of 
the computational algorithm of such an estimator is high, due to the pres- 
ence of the maximizing step (see the computational algorithm proposed by 
Hartigan, [8]). 

In this paper, we study a plug-in type estimator of the density level set 
(t), using a kernel density estimate of / (Rosenblatt, [14]). Given a kernel 
K on M'' {i.e., a probability density on JR^) and a bandwidth h = h{n) > 
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such that /i — ^ as n grows to infinity, the kernel estimate of / is given by 



1=1 

We let the plug-in estimate n{t) of (t) be defined as 

n{t) = {xelR'' : fn{x) > t}. 

In the whole paper, the distance between two borel sets in is a mea- 
sure -in particular the volume or Lebesgue measure A on M^- of the sym- 
metric difference denoted A (i.e., AAB = n B"") U (A" D B) for all sets 
A,B). Our main result (Theorem 2.1) deals with the limit law of 

V^A(„(t)A(t)), 

which is proved to be degenerate. 

Consider now the following statistical problem. In cluster analysis for 
instance, it is of interest to estimate the density level set corresponding to 
a fixed probability p G [0, 1] for the law induced by /. The data contained 
in this level set can then be regarded as the most important data if p is 
far enough from 0. Since / is unknown, the level t of this density level set 
is unknown as well. The natural estimate of the target density level set {t) 
becomes n{tn), where tn is such that 

/ fndX = p. 

n (in ) 

As a consequence of our main result, we obtain in Corollary 2.1 the exact 
asymptotic rate of convergence of to (t). More precisely, we prove that 
for some /?„ which only depends on the data, one has : 



in probability. 

The precise formulations of Theorem 2.1 and Corollary 2.1 are given in 
Section 2. Section 3 is devoted to the proof of Theorem 2.1 while the proof of 
Corollary 2.1 is given in Section 4. The appendix is dedicated to a change of 
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variables formula involving the (A;-l)-dimensional Hausdorff measure (Propo- 
sition A). 



2. The main results. 

2.1 Estimation of t-level sets. In the following, C (0,cx)) denotes an 
open interval and ||.|| stands for the euclidean norm over any finite dimen- 
sional space. Let us introduce the hypotheses on the density / : 

HI. / is twice continuously differentiable and f{x) — as — ^ oo ; 

H2. For all t e 9, 

inf IIV/II > 0, 

where, here and in the following, Vip{x) denotes the gradient at x G M'^ of 
the differentiable function ijj : IR!' — > IR. Next, we introduce the assumptions 
on the kernel K : 

H3. if is a continuously differentiable and compactly supported func- 
tion. Moreover, there exists a monotone nonincreasing function : 
M+^M such that K{x) = ^l{\\x\\) for all x G . 
The assumption on the support of K is only provided for simplicity of the 
proofs. As a matter of fact, one could consider a more general class of kernels, 
including the gaussian kernel for instance. Moreover, as we will use Pollard's 
results [15], K is assumed to be of the form /x(||.||). 

Throughout the paper, Ti denotes the (/c-l)-dimensional Hausdorff mea- 
sure on [cf. Evans and Gariepy, [16]). Recall that Ti. agrees with ordinary 
"(A;-l)-dimensional surface area" on nice sets. Moreover, dA is the boundary 
of the set Ac M'', 

'3 if A; = 1 ; 
k + A ifk>2. 



a{k) 



and for any bounded borel function g : — ^ \g stands for the measure 
defined for each borel set A C IRf^ by 



{A)= j gdX. 
J A 



p 

Finally, the notation — > denotes the convergence in probability. 

It can be proved that if HI, H3 hold and if X{d{t)) = 0, one has : 

X(nit)A{t)) ^ 0. 



The aim of Theorem 2.1 below is to obtain the exact rate of convergence. 

Theorem 2.1. Let g : ]Rj^ he a hounded horel function and assume 

that H1-H3 hold. If nh'' / {log nf^ oo and nh'^^^Xlognf 0, then for 
almost every (a.e.) t E @ : 



Remarks 2.1. • Notice that the rightmost integral is defined because g is 
bounded and {t) is a compact set for alH > according to HI. 

• In practice, this result is mainly interesting when 5=1, since we then have 
the asymptotic behavior of the volume of the symmetric difference between 
the two level sets. The general case is provided for the proof of Corollary 2.1 
below. 

• If we only assume / to be Lipschitz instead of HI, then / is an almost 
everywhere continuously differentiable function by Rademacher's theorem 
and Theorem 2.1 holds under the additional assumption on the bandwidth : 
nh'^^'^ilognf —>■ 0. 

2.2 Estimation of level sets with fixed probability. In order to derive 
the corollary, we need an additional condition on /. 

H4. For all t G (0,supjRfe /], X{f-^[t- e,t + e]) ^ as e ^ 0. Moreover, 
X{f-\0,e]) ^Oase^O. 
Roughly speaking, H4 means that the sets where / is constant do not charge 
the Lebesgue measure on M'^. Many densities with a finite number of local 
extrema satisfy H4. However, notice that if / is a continuous density such 
that A(/~^(0, e]) — >■ as e — >■ 0, then it is compactly supported. 

Let us now denote by V the application 

[0,supjifc/] ^[0,1] 
t ^ Xfiit)). 

Observe that V is one-to-one if / satisfies HI, H4. Then, for all p G [0, 1], 
let G [0, supjRfc /] be the unique real number such that Xf{(t^'))) = p. 

Morevover, let tn^ G [0,supjjfc /„] be such that Xf^{n{tn^)) = p. Notice that 

tn^ does exists since is a density on M^. 
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The aim of Corollary 2.1 below is to obtain the exact rate of convergence 
of n{tn) to (t). We also introduce an estimator of the unknown integral in 
Theorem 2.1. 

Corollary 2.1. Let k > 2, (q;„)„ be a sequence of positive real numbers 
such that an and assume that H1-H4 hold. If nh^~^'^ / log n — >■ oo, 
nh'^^'^ {log n)'^ 



and a^nh^ /{log ny oo then, for a.e. p G ^(6) ; 



(.(t(r))A(t(-))) ^J^J KHX, 



where Pn = an/Hn{tn^) - n{tn' + an))- 

Remarks 2.2. • It is of statistical interest to mention the fact that under 
the assumptions of the corollary, we have for all p G [0, 1] : t^^^ t^P^ with 
probability 1 (see Lemma 4.3). 

• When k = 1, the conditions of Theorem 2.1 on the bandwidth h do not 
permit to derive Corollary 2.1. In practice, estimations of density level sets 
and their applications to cluster analysis for instance are mainly interesting 
in high-dimensional problems. 

3. Proof of Theorem 2.1. 

3.1. Auxiliary results and proof of Theorem 2.1. For all t > 0, let 

(logn)^-' 



K = r 



^ _ (log n)^ ^ 



and Vl = f-^ 



t,t + 



where (3 > 1/2 is fixed. Moreover, K stands for the real number : 

k = J K^dX. 



Proposition 3.1. Let g : JR^ — > iR+ be a bounded borel function and assume 
that H1-H3 hold. If nh'' /{log n)^^f^ oo and n/i'^W (log n)^^ ^ 0, then for 
a.e. t eO : 

limVn^ / P{fn{x) > t)dXg{x) = limVnh^ f P{f„{x) < t)dXg{x) 

" Jvt n Jy* 




27r Jdit) IIV/II 



dn. 
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Proposition 3.2. Let g : M ]Rj^ he a hounded horel function and assume 



that H1-H3 hold. If nh'' / {log nf<^ oo and n/i"^'') (log n 
a.e. t eO : 



2/3 



0, then for 



lim nh var 

n 



= lim n/i var 

n 



A,(vln„(i)' 



Proof of Theorem 2.1. Let t G G be such that both conclusions of Propo- 
sitions 3.1 and 3.2 hold. According to H3 and Pollard ([15], Theorem 37 and 
Problem 28, Chapter II), we have almost surely (a.s.) : 

SUp|/n--B/„| ^0. 

Moreover, since both sup„£^/„(x) and f{x) vanish as — ^ oo by HI, H3, 
we have : 

sup|E/„-/| ^0. 
Thus, a.s. and for n large enough : 

SUp|/n - /I < \. 

Consequently, „(i) C (t/2) and since (t) C (t/2), we get : 
Let 

= {V^sup|/„-/| < (logn)^}. 

Since (i/2) is a compact set by HI, it is a classical exercise to prove that 
P{A^ — ^ 1 under the assumptions of the theorem. Hence, one only needs to 
prove that the result of Theorem 2.1 holds on the CA^ent A^. But on one 
has according to (3.1) : Ag(n(t)A(i)) = + J^, where : 

Jl = \^\^n{tr) and J^ = A,(v*n„(t)). 
By Propositions 3.1 and 3.2, if j = 1 or j = 2 : 




}d{t) IIV/II 



dU, (3.2) 
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if the bandwidth h satisfies n^"('^)(logn)^'' — and nh'^ /{log n)^^^ oo. 
Letting (3 = 16/31, the theorem is proved • 



3.2. Proof of Proposition 3.1. Let X be a random variable with density 
/, 

Vn{x) = vari^(^-^) and = -^=(/„(x) - EUix)), 

for all X G such that Vn{x) ^ 0. Moreover, $ denotes the distribution 
function of the M(0, 1) law. 

In the proofs, c denotes a positive constant whose value may vary from 
line to line. 

Lemma 3.1. Assume that HI, H3 hold and let C C be a compact set 
such that infc / > 0. Then, there exists c > such that for all n > 1, x E C 
and u E ]R : 

\P{Zn{x) <U)- < 



Vnh^ 



Proof. By the Berry-Esseen inequality (c/. Feller, [17]), one has for all n > 1, 
ue IR and x e M'^ such that Vn{x) : 



|P(Z„(x) < u) - Hu)l < ^^e\k{^) - EK(^) 



'X- 



^/nVnix)--" 

It is a classical exercise to deduce from HI, H3 that 

'X-X\ „„/x-Xn 3 



sup^ 



<ch'' and inf Vn{x) >ch'^, 



hence the lemma • 



For all borel bounded function g : — we let 60(5) to be the set 
of t € such that : 

hmiA4/-'(*-M0 =limiA,(/->[M + .l) =X^,^i|^<«. 



Lemma 3.2. Let g : IR^ — > he a borel bounded function and assume 
that HI, H2 hold. Then we have : @o{g) = @ a.e. 
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Proof. According to HI, H2, for all t E Q, there exists rj > such that : 

inf IIV/II > 0. 

We deduce from Proposition A that for alH G and £ > small enough : 
iA,(r'[t-Ml) = J£X^^,^<(«ci.. 

Using the Lebesgue-Besicovitch theorem (c/. Evans and Gariepy, [16], The- 
orem 1, Chapter I), we then have for a.e. t E Q : 

limiA,(r'l*-M])=4^^*i, 

and the same result holds for Xg{f~^[t,t + e]) instead of Ag(/~^[t — e,t]), 
hence the lemma • 

It is a straightforward consequence of Lemma 3.2 above that X{d{t)) = 
for a.e. i G G. For simplicity, we shall assume throughout that this is true 
for all t E @. Since is an open interval, we have in particular 

\(f-\t -e,t + e])= X(f-\t -e,t + e)), 

for alH G G and £ > small enough. 

We now let for t G G and x G iR^ such that f{x)Vn{x) / : 



tn{x) 



n 



■{t- f{x)) and tn{x) 



Kf{x)'' VVnix) 

and finally, $(n) = 1 — ^{u) for all u e R. 



it-EU{x)), 



Lemma 3.3. Let g : IRf^ he a hounded horel function and assume 

that HI, H2 hold. If nh^ / {log nfl^ oo and nh''+'^ {log nf^ 0, then for 
all t G &o{9) ■■ 



limV^[J^^ P{fn{x) > t)d\g{x) - j^^ ^{tn{x))d\g{x) 

and lim / P{fn{x) < t)d\g{x) - A ^{tn{x))d\g{x) 





:0. 
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Proof. We only prove the first equality. Let t G Qoid)- First note that for 
all xeM'' such that ^(x) ^ : 

P{fn{x) >t)= P{Zn{x) > tn{x)). 

There exists a compact set C C such that infc / > and C C for all 
n. Observe that by Lemma 3.1 and the above remarks, 

f P{U{X) > t)dXg{x) - f ^tn{x))dXg{x)\ < CXgiVi). 



Since Ag(V^) ^ by Lemma 3.2, one only needs now to prove that : 

En := [ Mtn{x)) - ^tn{x))\dXg{x) ^ 0. 

Jvi. 



One deduces from the Lipschitz property of $ that 

En < cVn^Ag(V*) sup \tn{x) - tn{x)\. (3.3) 

But, by definitions of tn{x) and tn{x), we have for all x EV^ : 
1 



Vnh^ 

< (\t-fix)\ 



1 



^/Kfix) ^Vn{,x)h-^ 



+ 



Vn{x) 



\Efnix) - fix) 



< 



\kf{x) - Vn{x)h-^\ 



(log n)f^ 

, VnP \ kf{x)Vn{x)h-^ ^ V Mx) 



■\Efn{x)-f{x)\\. (3.4) 



It is a classical exercise to deduce from HI, H3 that, since is contained 
in C, 

sup \Efn{x)- f{x)\<ch\ 



and similarly, that 



sup \kf{x) - Vn{x)h-''\ <ch. 



One deduces from (3.4) and above that 



sup \tn{x) - tn{x)\ < c {Vh {log nf + VnhF^) . 
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Thus, by (3.3) and since t G Qo{g), one has for all n large enough : 

En<c (log nf{Vh (log nf + Vnh^), 

and the latter term vanishes by assumptions on h, hence the lemma • 

Proof of Proposition 3.1. By Lemma 3.2, one only needs to prove Propo- 
sition 3.1 for all t G 60(5)- Fix t G Qoig), and let 

/„ := j^^ '^{tn{x))dXg{x) and 7„ := ^{tnix))dXg{x). 
By Lemma 3.3, the task is now to prove that 



lim V nh^ I„ = \/4 — [ dH = lim V nh^ In- 

Jdit) V/ 



We only show the first equality. One has 



In = , ^ / / exp ( - —^jdudXgix), 



where for all a; G iR^ such that f{x) > 0, 6„(x) = Vnf^it - f{x))/f{xf/'^. 
By Fubini's theorem : 



1 1"^ ^ \ ( I (logn)^ / u \^\ ]\ 
In = ^==\ cxp --^ Agl/ maxft ^=^,x(^= ,t du, 



u ^2^ 



where for all u > 0, x(i;) = — v/2 + (l/2)\/u^~+4t. It is straightforward to 
prove the equivalence : 



u 



(log n)^ 



where r„ = (logn)^ / y t ~ (log n)l^{nh'') ^Z^, so that one can split In into 
two terms, i.e., In = In + In^ where 

,2 



1= re.p(-^)Xg(f-'\x(^Y,t 



J roo , ^2 

and/2 = —j==\ exp( - -^jAJ/-^ 



^ (log ^ 



^du. 
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Since t G @o{g), one has for all n large enough : 



VW ll<c (log n)^ / exp ( ^ j du, (3.5) 

and the rightmost term vanishes. Thus, it remains to compute the limit of 
Vnh^In- Using an expansion of % in a neighborhood of the origin, we get 



) =uVi [ 



d(t) liv/ii 



dH, (3.6) 



for all It > 0, since t € Qoid)- Moreover, one deduces from Lemma 3.2 that 
for all n large enough and for all u G [0, rn] : 



X 



u 



)\t]) < cV^(i-x(^)') 



because Tn/VnhF 
theorem : 

lim VnhF I„ 



< cu, (3.7) 
0. Thus, according to (3.5)-(3.7) and the Lebesgue 

lim Vnh^ ll 
n " 

[ exp —\u\/i [ „,, dHdu 



V27rK 

Itk 



Jait) IIV/II 



dH, 



hence the proposition 



3.3. Proof of Proposition 3.2. Prom now on, we introduce two random 
variables A^i, N2 with law A/'(0, 1) such that Ni,N2,Xi,X2, ■ ■ ■ are indepen- 
dent. We let 

1 

-, Vn > 2. 



(log log log n ' 

(As we will see later, the random variable Zn{x) + (7„7Vi -for instance- has a 
density with respect to the Lebesgue measure.) For simplicity, we assume in 
the following that under H3, the support of K is contained in the euclidean 
unit ball of M''. 

Lemma 3.4. Let g : IR^ iR+ he a borel bounded function and assume 
that H2 holds. If nh^ / (log n)"^^ —>■ 00, then for all t G 00(5) there exists 
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c > such that for n large enough : 

I P{[Zn{x) > tn{x)}^[Zn{x) + a^Ni > tn{x)})dXg{x) < CWn, 

and P{^^Zn{x) < A|z„(a;) + cTniVi < tnix)^^dXg{x) < cwn, 

where Wn = {\og n)^ / {nh^) + cr„(log /\/nh^ . 

Proof. We only prove the first inequality. Let t G Q{j{g) and 

Pn := / P{{Zn{x) > t„(x)}A{Z„(x) +(7„7Vi > t^{x)])d\g{x). 
By independence of Ni and Zn(x), Pn is smaller than 

j^^ j e^^(^-^)pi^[Zn{x)>tn{x)]i^[Zn{x) + anZ>tn{x)])dzdXg{x), 



/ exp f - - 

and consequently, 



Pn< j^J exp ( - -?„(x)| < an\z\)dzdXg{x). 

Since t G 60(5), one deduces from Lemma 3.1 that for n large enough : 

j^^ y"exp ( - ^)p{^Ni-tn{x)\ < an\z\)dzdXgix) 



{\ognY cj„(logn)^ 

< C -T h 



hence the lemma • 

Lemma 3.5. Fix t E & and assume that HI, H3 hold. Then, there exists a 
polynomial function Q of degree 5 defined on IP? such that for all (^1,^2) G 
iR^ and n large enough : 

Eexp (j,(uiZn{x) + U2Zn{yfj^ - Eexp (iuiZn{x)^ E exp (iu2Zn{yfj 

Q{\ui\,\u2\) 



< 



if x,y &Vn^ vli are such that \\x — y\\ > 2h. 
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Proof. First of all, fix ui, ^2 G iR, y G U V„ and consider the following 
quantities : 



Ml 
and M2 



Ul 



'X-X^ 



One deduces from the inequality | ex.p{iw) — 1 — iw + w'^ /2\ < \w\ Vii; G JR 
that 

£;exp (z(Mi + M2)) - 1 + -E{Mi + Ms)^ 



exp (z(Mi + M2)) - l-i(Mi +M2) + ^(Mi + M2 



< £;|Mi + M2|''. 



In a similar fashion, if j = 1 or j = 2 : 



Eexp{iMj) - 1 + ^EMf 



Consequently, 



E 



exp(zM,) - 1 - iMj + -Mf < E\Mj f. 



Eexp (i^Mi + M2)) - Eexp (iMi^Eexp (iM^J 



< E\Mi+M2r + 
+ 



1 - -E\Mi +M2I 



1 - l-EMt ] ( 1 



-EMo 



1 - -EM? 
2 ^ 



E|M2|'^ + 1 - -£;M| E|Mi|^ (3.8) 



It is an easy exercice to prove that for all n large enough, one has inf V^(x) > 
ch^, the infinimum being taken over all x G U V„. Consequently, if ,7 = 1 
or _7 = 2 : 



E\Mj\^ < c 



from which we deduce that 



£:|Mi + M2P < c 



Moreover, EMl = u\/n, EM2 = u^/n and for all x, y G V* U such that 
— y\\ > 2/i : 



E{Mi + M2)^ = EM^ + EMl - 



U1U2 



nVVn{x)Vn{y) 



EK 



x-X 
h 
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because the support of K is contained in the unit ball and hence 

One deduces from above and (3.8) that for all x,y G U v|j such that 
Ik -2/11 > 2/i : 



< c 



+ 



(U1U2) 



Eexp (iMi]Ecxp (iM2 



+ c 



1^2^(1 + + |mp(i + tti) 



+ c 



By assumption, nh^'^ — > so that for n large enough : < l/VnhF. Con- 
sequently, 



< 



Qi\ui\, \U2\ 



Vnh^ 



Eexp (i(^Mi + - £;exp (iMi^Eexp (^iM2^ 

where Q is defined for all ui , U2 G by : 

Q{ui,U2) = c{ul + ul + {uiU2)'^ + U1U2 + uluf + uful). 
Consequently, for all ui,U2 G M and x,y G V^UV^ such that — y|| > 2h : 
Eexp (^i(uiZn{x) + U2Zn{y)^^ - Eexp (j,uiZn{x)jEexp {iu2Z, 



< n 



(Eexp (i(^Mi + M2^)Y - (Eexp (iMi^Eexp (iM2)y 
£^exp (i(^Mi + M2)) - -Eexp (iMi^Eexp (iM2^ 



< 



Q{\ui\, \U2\ 



hence the lemma • 

In the following, uv stands for the usual scalar product of u, v G iR^. 

Lemma 3.6. Letx,y G be such that Vn{x)Vn{y) / 0. Then, the bivariate 
random variable 

( Zn{x)+anNi \ 
\ Z^{y)+anN2 



has a density (^^'^ defined for all u G ]B? by 



1 

4^ 



E 



exp (i(yiZn{x) + V2Zn{y 
15 



exp (^ — iuv (T^||f Ip^df . 



Proof. By independence of Xi, • • • , X„, A'^i and A''2, the random variable 



Zn{x) 

Zn{y) 



N2 



has a density ip^'^ defined for all u = {ui,U2) G by 



E 



exp 



( 



(^1 - Znix)f 



) exp ( 



{u2 - Zn{y))' 



Using the equality 



V2^ — n 

we deduce from the Fubini theorem that 

"y^n^W = ^ / E e-^^{i{viZn{x) + V2Zn{y)y^ e-x.-g(^-iuv--al\\v\\'^^dv, 
hence the lemma • 

Proof of Proposition 3.2. We only prove the first equality of Proposition 
3.2. According to Lemma 3.2, one only needs to prove the result for each 
t G 60(5)- Hence we fix i G 60(5) and we put : 

An{x) = {Zn{x) > tnix)}, A^ix) = [Znix) + anNj > tn{x)} , j = 1,2, 

for all X G iR*^ such that V^(x) 7^ 0. First note that since the events 
and {fn{x) > t} are equal, one has 



= j^^^^^^(p{An{x)nAn{y))-P{An{x))P{An{y)))dXf{x,y). (3.9) 

But, by Lemma 3.4 and since t G 60(5), one has for all n large enough : 
nh'^ [ (p{An{x)^An{y))-P{Ai{x)^Al{y)))dXf{x,y) 

< 2nh}'\g{Vl) I P{Ar,{x)AAl(x))d\g{x) 

< c(logn)/^V;^(i^ + ^^) 



< c 
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and the latter term tends to by assumption. In a similar fashion, one can 
prove that 

By the above results and (3.9), it remains to show that 

"^'/v*)x2 {nAl{x)nAl{y))-P{Al{x))P{Al{y)))dXf{x,y) ^ 0. (3.10) 

Let T{h) = {ix,y) £ {IR^Y'^ : ||x - y\\ < 2h}. According to the Pubini 
theorem, 



nh''Xf(^{V'X^nT{h)) = nh'' IXg(yinB{x,2h))d\g{x) 

< nh^ [ Xg{B{x,2h))dXg{x), 

where B{z,r) stands for the euclidean closed ball with center at z £ IR^ and 
radius r > 0. Since t G ©0(5), one deduces that 



< c ^ nh^^ {log , 
so that, by assumption on the bandwidth h : 

\imnh^Xf({ViY^ n TQi)) = 0. 

Let now Sn = (V^)^^ nT(hy'. According to (3.10) and the above result, one 
only needs now to prove that : 

nh'' [ {p{Ai{x)nAl{y))-P{Al{x))P{Al{y)))dXf{x,y)^(}. (3.11) 

By Lemmas 3.5 and 3.6, one has for all x^y E Sn : 
P{Al{x) n Aliy)) - P{Ai{x))P{Al{y)) 

j E exT£> {i{uiZn{x) + u-2Zn{y)y^ 

—Eex]) (^miZ„(x)^£'cxp {iAi2Zn{y)^ exp - ^a1\\u\\^^duidu2 



< 



< 
< 



1 /" 1 

== / Q(|'"iU'"2|)exp (- -(7^11^11^ )dui(iu2 



Vnh^ 

c 



17 



where Q is the polynomial function defined in Lemma 3.5. Consequently, one 
has for all n large enough : 

nfc' f {P(AI[X) n Alin)) - P{Ai{x))P(Al{y)))d\f{x, y) 



(logn)^'^ 

< c , 

cr^V nh'^ 

which tends to by assumption, hence (3.11) • 
4. Proof of Corollary 2.1. 

Lemma 4.1. Let k>2 and assume that H1-H3 hold. If nh^'^^{}ognf 
and nh'^/{logn)^^ — oo, then for a.e. t E @ : 



Proof. Let t G be such that the conclusion of Theorem 2.1 holds both for 
g = f and g = 1. Notice that 

A/„((i))-A/„(n(i)) = J fn{Mf>t}-l{f„>t})dX 
As in the proof of Theorem 2.1, we see that the result of the lemma will hold 

/ P 

if we show that v nh^Kn — 0, where 

Kn ■■= L fn'i-{U<t}dX - fnl{U>t}d\. 

Split Kn into four terms as follows : 

Kn = Lifn- f)l{U<t}dX- J ifn- fn{U>t}dX 

+ Hu<t}dXj - j^^ !{/„>*} dA^. (4.1) 
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On one hand, it is a classical exercise to deduce from HI, H3 that 

sup|/„ - /I ^ 0. 

Thus, using (3.2), 

L{fn-f)l{u<t}dX^O. 

In a similar fashion : 

On the other hand, we get from (3.2) that : 

where the limits are in probability. By the above results and (4.1), V nh^Kn 
tends to in probability, hence the lemma • 

Lemma 4.2. Let k > 2, t e Q and assume that HI, H3 hold. Ifnh^^'^ — > 0, 
then : 

Proof. Observe that 

A/((i))-A/„ ((*))= / {f-EU)dX+ [ {EU-fn)d\. 
J{t) J{t) 

According to HI, H3, we have : 

/ \f-EU\dX<ch^ 
J{t) 

and since nh'^'^^ — ^ 0, we only need to prove that 

[ {EU-fn)dX^O. 
J it) 
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We prove that this convergence holds in quadratic mean. We have : 



dxdy. 



Recall that we assume in Section 3.3 that the support of K is contained in 
the unit ball so that if \\x — y\\ > 2h, 

Letting R{h) = {{x,y) G {t)^^ : \\x — y\\ < 2h}, one deduces from above 
that 



E(yn}^ j^^^{Efn- fn)d\f < ^ J^^^J K[^j^)f{u)dudxdy 

< c / K{v)f{x — hv)dvdxdy 
Jr(K) J 



IR{h) . 

< cX^\R{h)) 



< 



'(*) 

according to the Pubini theorem. Thus, we get : 



c j x(^{t)nB{x,2h)^dx, 



E(y^ j^^i^Efr, - fn)dxf < ch\ 



hence the lemma 



Lemma 4.3. Let p G [0, 1] and assume that HI, H3 and H4 hold. If 
nh'^/logn oo, then tn^ — > t'^^^ a.s. 

Proof. Let t = t^P^ and t„ = t^^\ As seen in the proof of Theorem 2.1, 
supjfjfc \ fn — /I — a.s. Hence, one can fix 



a;G|sup|/„-/|^o|. 



For notational convenience, we omit lo until the end of this proof. Since / is 
bounded, one has sup^supjfjfc /„ < oo and consequently sup„t„ < oo. Thus, 
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from each sequence of integers, one can extract a subsequence {nk)k such 
that tn^. — ^ t*. On one hand, according to Scheffe's theorem, 

lim(Aj„^U(tnJ)-A/U(t„J)) =0, (4.2) 
since both / and are density functions on M'' and 

^/n,(nfe(*nj) - A/(n,(i„J)| < j {fn^ - f\dX. 

On the other hand, letting = supj^k \fnk ~ /I) one observes that 



^fiituk)) - -^/(nfe(*nj) 



£fc</<W+£fc} 



+ £fc]n(o,sup/])) 



and the latter term tends to as A; ^ oo under H4 (consider separately the 
two cases : t* = and t* > 0). One deduces from (4.2) that : 

lim(AK(t))-AK(tnJ)) = lim{p-Xf{itnk))) 

= (-^/n, Uk (in J) - A/(nfc (tn J)) 

+ lim(A/(„,(t„J)-A/((t„J)) 



0. 



(4.3) 



Moreover, the appHcation .s Ay((s)) defined on [0,supjRfc /] is continuous 
according to H4. Consequently, one has 

and thus, by (4.3), Xf{{t)) = Xf{{t*)) and hence t = t* because V is one- 
to-one. One conclude t„ t since we proved that from each sequence of 
integers, one can extract a subsequence {nk)k such that — > t. The lemma 
is proved • 



Lemma 4.4. Let k > 2 and assume that H1-H4 hold. //ra/i^+^(log 
and n/i'^"'"^/ log n — > oo, then for a.e. p G V{Q) : 



n] 







^fnhF 



t(p) 



(p) 



dn{s) ||V/„ 



^ dUds^ 0. 
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Proof. One only needs to choose p G 'P(B) such that the conclusion of 
Lemma 4.1 holds for t^^K For simplicity, let t = t^^^ and tn = tn \ It is a 
classical exercise to prove that since nh'^'^'^ / log n — > oo and nh''+'^ 0, 

l|V/„|| - IIV/II a.s., 

uniformly over the compact sets. Thus, by Lemma 4.3 and H2, we have a.s. 
and for n large enough : 

inf ||V/„|| > 0. (4.4) 

/ ^[mhi{t„,t),max{t„,t)] 

We deduce from Proposition A that a.s. and for n large enough : 

-^/nUC^n)) - A/„(„(t)) = j {}{fn>tn}-'^Un>t})d^fn 



'^{tn<fn<t}d>^fn 



fn 



-dH ds, 



'dr,{s) ||V/„| 

where the latter integral is defined according to (4.4). Consequently, 



A/Jn(in))-A/„(n(i)) 



Lin(t„,t) Jdn{s) ||V/, 



rdH ds. 



By Lemma 4.3, one has a.s. and for n large enough : tn > t/2. Since 
A/„(n(*n)) = p = Xf{{t)), one deduces that : 

A/((«))-A/„(„(t)) >- / ^^dUds. 

^ Jmin{tn,t) Jd„{s) II V/„|| 

We can now conclude the proof of the lemma because 



by Lemmas 4.1 and 4.2 • 



A/((0)-A/„(n(i)) 



0, 



Lemma 4.5 Assume that m m hold. If nh''/ {log n)^ — > oo, then for a.e. 



logn 



tip) 
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Proof. By H2 and the Lebesgue-Besicovitch theorem (Evans and Gariepy, 
[16], Theorem 1, Chapter I), we have for a.e. p G V{Q) : 



f 



£ Jt(p)-eJd{s) IIV/II 



dH ds 



f 



d{t(p)) IIV/II 



dH, 



as e \ 0. Thus, one only needs to prove the lemma for p G 'P(O) such that 
the above result holds. For convenience, let t = t^^ and 

tn = ■ It suffices 

to show that 



Vnh^ 
logn 







on the event An defined by 



An = \ sup \fn - f\< Tn], 
(t/2) 

where r„ = (log n)^ ^ / VnhF , because P{An) — > 1 (see the proof of Theorem 
2.1). According to Lemma 4.3, one has a.s. and for n large enough : (tn) U 
n{tn) C {t/2) on the event A^. Then, 



|A/((in)) - A/Jn(in))| 



(tn) 



fdX 



(tn) 



fndX 



< [ \fn-f\dX+[f 

J(t/2) J 



dX 



< Crn + cX(^{tn)An{tn)). (4.5) 

But, on An : 

A((t„)A„(t„)) < A({tn -rn<f<tn + r„}). 
By HI, H2, there exists a neighborhood V of t such that 

inf IIV/II > 0, 

thus, by Lemma 4.3, one has a.s. and for n large enough : 

x(^itn)Anitn)) < sup a({s - r„ < / < s + r„}) 



< crn, 
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where the latter inequahty is a consequence of Proposition A. According to 
(4.5), one has on A„ and for n large enough : 

|A/((tn)) - Xfim = |A/((tn)) - A/„(„(t„))| < cr„. 
Observe now that by Proposition A and our choice of t, one has a.s. : 

tn-t Jd{t) IIV/II 

thus on An, 

\tn - i| < cr„, 
for n large enough, hence the lemma • 

Lemma 4.6. Assume that H1-H4 hold and let (an)n be a sequence of posi- 
tive real numbers. If an ^ 0, a^nh^ / {\ogrif' — oo and nh^ / ilogri)^ — oo, 
then for a.e. p G V{Q) : 



Proof. According to Proposition A and HI, H2, H4, one has for a.e. t E & : 



dn, 



as £ \ 0. Hence, it suffices to prove the lemma for all p G 7^(0) such that the 
above result holds with t = t^P\ For convenience, let t = t^^ and t„ = tn^. 
By Lemma 4.5, one only needs to prove that 

1 1 /" 1 

Mn{tn) - n{tn + On)) = A( |t„ < fn < tn + On) ) ^ / ..^ ... dU, 

an ^ / q;„ VL jy IIV/II 

on the event Bn defined by 

Bn= { sup \ fn - f\< Vn, \tn - t\ < Vn] , 
(*/2) 



where Vn = logn/Vnh^ , because P{Bn) — > 1. But, for n large enough, one 
has ni'tn) U (t) C {t/2) on B„. Consequently, 



1 

an 



X{{tn <fn<tn + "n}) " \{{t <f<t + an}) 
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< —X({t - 2Vn <f<t + 2Vn}) < C — , 

and the latter term tends to by assumption on a„. Finally, the choice of t 
implies that 

—X({t <fn<t + an}) ^ uWJudn, 

(Xn J(t(p)) IIV/II 



SO that on S„ : 

1 /" 1 
X({tn < fn<tn + ttn}) ^ / , , TT^TFudn, 

hence the lemma • 

Proof of Corollary 2.1. According to Lemma 4.3, Lemma 4.6 and Theorem 
2.1, one only needs to prove that for a.e. p G ViQ) : 



0. 



Moreover, it suffices to show the above result for each p G 7^(S) such that 
the conclusion of Lemma 4.4 holds. Fix such a p E 7^(6) and, for simplicity, 
let t = t^P^ and 

tn — tn^. A straightforward computation gives the relation : 
Dn ■■= X{nitn)Ait)) " A(n(t)A(t)) = J (l{/„>t„} " l{f„>t})vdX, 

where 77 = 1 — 21{y>^}. Then, 

= j htn<fn<t}V dX- j l{t<f^<t„}ri dX. 
By (4.4) and H3, one can now apply Proposition A, which gives : 



is) l|v/„ 



rdH ds. 



Consequently, 



\Dn\ < / / -rr=r^dnds., 

Jmin(t„,t) Jd„{s) ||V/„|| 

SO that by Lemma 4.4 : 

V^Dn = V^[x(^n{tn)^{t)) " A(„(t)A(t 
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hence the corollary • 

Appendix : A change of variables formula. Proposition A below is a 
consequence of the change of variables formula given in Evans and Gariepy 
([16], Chapter III, Theorem 2). For a similar proof, see also Chapter III, 
Proposition 3 in the same book. 

Proposition A. Let ip : JR!^ — ^ IR+ be a continuously differentiable function 
such that (p{x) as \\x\\ — ^ oo, and I C iR+ be an interval such that 
inf / > and 



is integrable because f ^(/) is bounded. Proposition A is then an easy con- 
sequence of Theorem 2 in Evans and Gariepy ([16], Chapter III) • 
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Proof. Notice that is a locally Lipschitz function and 



5l<^-i(/) 
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